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ABSTRACT 

Observations suggest that the effective radii of high-redshift massive spheroids are as 
much as a factor ~ 6 smaller than low-redshift galaxies of comparable mass. Given the appar- 
ent absence of low-redshift counterparts, this has often been interpreted as indicating that the 
high density, compact red galaxies must be "puffed up" by some mechanism. We compare the 
ensemble of high-redshift observations with large samples of well-observed low-redshift el- 
lipticals. At the same physical radii, the stellar surface mass densities of low and high-redshift 
systems are comparable. Moreover, the abundance of high surface density material at low red- 
shift is comparable to or larger than that observed at z > 1—2, consistent with the continuous 
buildup of spheroids over this time. The entire population of compact, high-redshift red galax- 
ies may be the progenitors of the high-density cores of present-day ellipticals, with no need 
for a decrease in stellar density from z — 2 to z = 0. The primary difference between low and 
high-redshift systems is thus the observed low-density material at large radii in low-redshift 
spheroids (rather than the high-density material in high-redshift spheroids). Such low-density 
material may either (1) assemble at z < 2 or (2) be present, but not yet detected, at z > 2. 
Mock observations of low-redshift massive systems suggests that the amount of low-density 
material at high redshifts is indeed significantly less than that at z = 0. However, deeper obser- 
vations will be important in constraining the exact amount (or lack thereof) and distribution 
of this material, and how it builds up with redshift. We show that, without deep observations, 
the full extent of such material even at low redshifts can be difficult to determine, in particular 
if the mass profile is not exactly a single Sersic profile. We discuss the implications of our 
results for physical models of galaxy evolution. 
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1 INTRODUCTION 

Recent observations suggest that high-redshift spheroids may have 
significantly smaller eff ective radii than l o w-redshift analogues 
of the same ma ss (e.g. iDaddi et all 120051: iTruiillo et al.l [ 2006a 
20071: IZirm et all 120071: ; f 



Toft et al. 2007; van Dokkum et al. 2008 



Franx et al.l 120081; Ivan der Wei et alj 120081 : Icimatti et al] 120081 : 



Buitrago et alj|2008l) . The apparent differences are dramatic: the 
inferred effective radii are as much as a factor ~ 6 smaller at fixed 
stellar mass in the most massive galaxies at z = 2. Whatever pro- 
cess explains this apparent evolution must be particular to this class 
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of gal axies: disk galaxies are not similarly compac t at high red- 
shift ( Ravindranath et al. 2004; Ferguson et al. 2004; Barden et al. 
l2005l :l Somerville et al.ll2008l) . As such, these observations repre- 
sent a strong constraint on models of galaxy and bulge formation. 

Relative to the abundance of massive galaxies today, there are 
not a large number of compact systems at high redshift. However, 
even if just ~ 10% survived intact to z — 0, this would greatly 
exceed th e observed number d ensity of such systems in the local 
Universe dTruiillo et all 12009). In fact, at fixed stellar mass, el- 
liptic als with older stellar populations appear to have the largest 



radii dGallazzi et alfeOOfj ; iBernardi et alfcool iGraves et alj2009l: 
Ivan der Wei et al.ll2009l) 

The challenge for both observations and models is therefore 
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to understand how these high-redshift systems could evolve to be- 
come "typical" spheroids today. Their masses, number densities, 
and clustering dictate that they a re the progenitors o f the most mas- 
sive el lipticals and BCGs today dOuadri et alJl2007l ; lHopkins et al.l 
l2007d) . These systems have much larger R e and thus have lower 
effective densities £ e ff = M* (< R e ) / (wRl). However, this does not 
necessarily mean that the physical densities are lower than those 
of the high-redshift systems. One way to increase R e would be to 
uniformly "puff up" the profiles, lowering the physical density ev- 
erywhere. This would imply that the central densities of massive 
high-redshift ellipticals would need to decrease by two orders of 
magnitude from z = 2 to z = 0. Alternatively, R e can change by just 
as much by adding a relatively small amount of mass at low surface 
densities and large radii, without affecting the central density at all. 
In other words, an evolving effective density does not necessarily 
imply an evolving physical density at all radii. 

Buildup of an "envelope" of low-density material is 
expected as massive early-forming galaxies undergo late- 
time (major and minor) gas-poor merger s with later-forming, 
less-d e nse ellipticals, disks , and d warfs ( Gallagher & Ostriker 
19721; lOstriker & Tremaind 1 19751; lHausman & Ostriker] 1 19781; 



Weil & Hernquist 1994, 1996; Naab et al.l20071) . This is a relatively 



efficient channel for size-mass evolution, yielding factor of several 
size evolution with on ly a factor 1.5 — 2 increase in stellar mass 
( Hopki ns et al. 2009d). But this less dense material added at large 
radii does not significantly affect the high-density coreQ and so 
these models predict that the dense, high-redshift systems should 
survive to become the central regions of (some fraction of) today's 
massive ellipticals. 

If, on the other hand, high-redshift systems evolved primarily 
by equal-mass dry mergers between equivalently dense spheroids, 
then this will "inflate" the profiles relatively uniformly. In this ex- 
treme case, effective radii and stellar mass both approximately dou- 
ble in the merger; high-redshift systems would be uniformly more 
dense than their low-redshift desc endants (see e.g. Hernqui st et al.l 
ll993l;lBovlan-Kolchin et alj|2006l) . 

Distinguishing between these possibilities, as well as other dy- 
namical, stellar evolution, or observational effects that could lead 
to apparent size-mass evolution clearly depends on understanding 
in detail differences in the surface density profiles of spheroids as a 
function of radius, at low and high redshift. In this paper, we quan- 
titatively compare low and high-redshift observations to constrain 
these scenarios and inform how systems are evolving from z > 2 to 
z~0. 

In § [2] we directly compare the observed profiles of high and 
low-redshift massive spheroids, and show that, at the same phys- 
ical radii, their stellar surface mass densities are comparable. The 
massive, high redshift systems appear no different than the "cores" 
of today's massive ellipticals. In §[3] we determine the distribution 
of maximum/central densities, and show that this has not evolved 
significantly from z = to z > 2. In § [4] we calculate the mass 
function (and global mass density) of these high-density "cores" at 
both low and high redshift. This allows us to quantitatively com- 
pare the abundance of high-density material observed at both low 
and high redshifts. We show that there is as much or more high- 
density material in the cores of massive spheroids at z — as is ob- 
served to be in place at z — 1—2. The difference between low and 



1 By "core," we refer to the central regions of the galaxy, not to any specific 
class of central profile slopes. We use the phrases "cusp ellipticals" or "core 



ellipticals" to distinguish these. 



high-redshift systems, we conclude, lies in the lack of observed low 
surface-density envelopes around the high-redshift systems. In §|5] 
we show that, although such envelopes are weaker at high redshifts, 
some caveats should be born in mind with respect to the determina- 
tion of the mass in low-density material in high-redhsift spheroids. 
We summarize our results and discuss their consequences for phys- 
ical models of spheroid evolution in §[6] 

Throughout, we a ssume a WMAP5 co smology (Qm = 0.27, 
fU = 0.73, h = 0.705; iKomatsuetalJ 2009), but the exact choice 
makes no significant difference. 



2 SURFACE DENSITY PROFILES OF ELLIPTICALS AT 
HIGH AND LOW REDSHIFT 

Figure Q] shows a direct comparison of the observed surface stellar 
density profiles of high-redshift compact galaxies and low-redshift 
massive galaxies. At lo w redshift we compile obser ved surface 
bright ness profiles from iKormendv et alj (2009) and lLauer et"ai] 
(2007); this consists of a total of ~ 180 unique local ellipticals 
with nuclear HST observations and ground-based data at large radii 
(allowing accurate surface brightness profile measurements from 
~ lOpc to ~ 50 kpc)0The isophotally averaged major axis profiles 
are measured in rest-frame optical; we convert to a stellar mass pro- 
file based on the measured total stellar masses and the assumption 
of a radius-independent stellar mass-to-light ratio. Conversion to 
stellar mass profiles using e.g. color or stellar population gradients 
and comparison of profiles from different instrume nts and wave- 
bands in these samples are discussed extensively in Hopki nTet al.l 
(20093, 120083) : the differences are much smaller than the scatter 
between individual profiles, and do not affect our conclusions. In 
Figure[T we restrict our comparison to massive galaxies with M* > 
10 u Mg|j because these systems are most likely to be descen dants 
of massive high-redshift galaxies. The lKormendv et al.l ( 2009) sam- 
ple is a volume-limited survey of the Virgo spheroid population; as 
suc h it includes few v ery massive galaxies (M» > 3 x 10 1 ' Mq). 
The lLauer etail d2007l) galaxies are chosen to be representative of 
massive ellipticals in the local Universe, including more massive 
systems up to a couple 10 12 Mq. At the masses of interest, both 
are representative of t he distribution o f spheroid sizes in the local 
SDSS galaxy sample dShen et alj|2003h . 

Figure Q]compares the low-redshift sample with the observed, 
PSF de-convolved profiles of nine high-redshift compact massive 
galaxies (Af « > 1O H M0, R e ~ lkpc ), specifically the z ~ 2 — 3 
sample from lvan Dokk umetall (l2008). This is a well-studied sam- 
ple that represents the extreme of implied size evolution: the in- 
ferred average R e is a factor of ~ 6 smaller than local spheroids of 
the same mass. Figure Q] shows the best-fit Sersic profile of each 
galaxy in the sample; stellar mass-to-light ratios are determined 
by assuming a radius-independent M*/L and normalizing the ob- 
served portion of the profile to the total stellar mass determined 



2 Note that although the composite (HST+ground-based) profiles were 
used in lLauer et al 1 120071) to estimate effective radii, they were not actu- 
ally shown in the paper. 

3 Stellar masses for all objects are determined from the combination of rest- 
frame optical and near- IR photometry , correc ted t o an assumed IChabrieJ 
120031) IMF. We refer to lHopkins etafl fe009al) and lKriek etail l2008bl) for 
details of the low and high-redshift samples, respectively. Varying the spe- 
cific bands used to determine stellar masses makes little difference, and 
changing the IMF will systematically change the stellar masses of all ob- 
jects considered, but will not change our comparisons. 
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R 1/4 [kpc 1/4 ] R 1/4 [kpc 1/4 ] 

Figure 1. Direct comparison of the (major axis) surface stellar mass density profiles of z > 2 compact massive spheroids (M„ ~ 10 Mq , best-fit R e ~kpc; red 
dashed) and local, massive ellipticals (M* > 10 11 M Q , R e > A- 5kpc; solid). Both £ and R are in physical units. The high-z profiles are th e PSF de-convolved 
fit, plotted over the observed range in radius given the best-case limitations in seeing and surface brightness (from van Dokku m et al.l2008l) . The low-j profiles 
co mbine space and ground-based photometry to obtain very large dynamic range; they are from the lKormendv et al.l 120091) Virgo elliptical sample (left) and 
the lLauer et all 120071) local massive elliptical sample (right). The former is a volume-limited sample, so contains fewer very massive galaxies with high central 
surface brightness. Although the high-z systems have much smaller R e , their densities at any physical radius are not unusual compared to the local objects: the 
central ~ 1 — 2 kpc of massive ellipticals today are just as dense. The difference in R e owes to the presence of the large wings/envelopes at low surface density 
in the low-redshift objects. 



from photometry and spectroscopy in iKriek et alJ d2006Ll2008bl lah. 
We plot the profile of each system over the maximum radial range 
observed: from the scale of a single pixel at the observed redshift to 
the limiting surface brightness depth of the best images. Both the 
low and high-redshift systems are plotted in terms of major- axis 
radii (a non-negligible correction). 

At low redshift, the stellar mass-to-light ratios of ellipticals 
appear to be nearly independent of radius (reflected in e.g. their 
observed weak color gradients), but the stellar mass-to-light ra- 
tio ma y depend significantly on radius i n the high-redshift sys- 
tems dTrager et af]|200fj| : ICote et aljfcood : ISanchez-Blazquez et all 
120071) . However, based on the observed stellar population gra- 
dients in local ellipticals, the observed ages/colors of the high- 
redshift systems, or the outcomes of numerical simulations, the 
expected variation in M» /L is such that a young, recently-formed 
post-starburst stellar population at the center of the high-redshift 
galaxy will have high er L/M, than older stars at larger R e (see e.g. 
Iriopkins et al. 2008b). This would make the high-redshift systems 
less dense than we assume here; we conservatively allow for the 
maximal stellar mass densities in those systems. 

The comparison in Figure Q] is quite striking: although the 
best-fit effective radii and effective surface densities of the high- 
redshift systems are quite different from their low-redshift ana- 
logues, the actual stellar surface mass densities at any given ob- 
served radius do not appear significantly higher than a substantial 
fraction of the low-redshift population. In other words, inside the 
same observed radii ~ 1 — 5 kpc, many of today's massive ellipti- 
cals are just as dense as the high-redshift systems. The difference in 
effective radius stems primarily from the fact that the low-redshift 
systems have substantial extended wings/envelopes of low surface- 



brightness material (S <C 10 9 MQkpc -2 ); by contrast, the infer- 
ence from fitting the high-redshift systems is that their profiles fall 
more rapidly at large radii (as we discuss further below). 

For the sake of comparison with these and future high-redshift 
observations, it is useful to define the "upper envelope" of low- 
redshift galaxy density profiles. Formally, we can take e.g. the 
+ 1 a ~ 86% contour of the combined sample of density profiles 
shown in Figure [T] but we find that this can be conveniently ap- 
proximated with a simple Sersic function. This envelope is approx- 
imately given by 

£+i CT (z = 0) -4.5 x lO 12 M kpc" 2 e" 1L67 ^'' . (1) 

This is a Sersic profile with n s — 6, R e — 40 kpc, and total stellar 
mass M t = 1.7 x 10 12 Mq. Note that most single galaxies do not 
remain along this envelope over its entire extent. Rather, at each 
radius, this represents the + 1 a upper extent of observed densities 
within the spheroid population (i.e. the most dense systems at each 
radius); Most individual systems approach it over some more lim- 
ited dynamic range. This is a useful comparison quantity in par- 
ticular because, although the mean profiles of z — ellipticals are 
well-known, there has been relatively little parameterization of the 
scatter in profile shapes. Thus, even if high-redshift ellipticals are 
more dense than the median system today, if they do not exceed the 
relation given by EquationQ] then their densities can be accommo- 
dated within some portion of the present-day spheroid population, 
so long as they do not represent a large fraction of the present-day 
abundance of spheroids. As expected, we find that at their centers, 
the z — 2 systems are at most comparable to this upper envelope, 
i.e. comparable to the most dense z = spheroid cores, and at large 
radii, they fall well below the envelope. 



© 0000 RAS, MNRAS 000, 000-000 



4 Hopkins et al. 



3 CENTRAL STELLAR DENSITIES 

Figure|2]plots the peak surface stellar mass densities E pca k obtained 
in both the low and high-redshift galaxy populations. The "maxi- 
mum" or peak surface density must be defined within some radius, 
for galaxies whose surface density continues to rise to unresolved 
radii (e.g. cusp ellipticals; although most of the local massive galax- 
ies are core ellipticals, with relatively flat maximum surface den- 
sities within ~ 50 — 500 pc). For example, the maximum stellar 
surface density can be defined as the average E interior to some 
fixed small radius ~ 50 — lOOpc, or a fixed fraction of R e from 
~ 0.02 — 0.04, or by extrapolation to R — of a best-fit Sersic pro- 
file - we are simply interested in comparing the central densities at 
small radii in both low and high-redshift systems. For each determi- 
nation, we find qualitatively similar results albeit with some small 
normalization differences; Figure [2] shows E pca k determined from 
averaging within 50pc and within 0.02R e . For the high-redshift 
systems, PSF and seeing effects smear out the maximum observed 
surface brightness/density inside ~ lkpc; we extrapolate the best- 
fit Sersic profiles inwards to the same radii as the local systems. 
Making this same approximation in the low-redshift samples shows 
that it is reasonable, but tends to slightly over-estimate the central 
surface density, especially in core ellipticals. 

At both low and high redshift, there is a characteris- 
tic maximum central surface stellar mass density ~ 0.3 — 2 x 
10 n MQkpc~ 2 , with significant, but still surprisingly little scatter 
given the known diversity in the profile shapes of ellipticals (e.g. 
variation in Sersic indices and cusp versus core populations). The 
maximum is similar whether we include ellipticals of all masses 
(~ 10 9 — 10 12 Mq), or restrict to cusp or core populations; although 
dry mergers are expected to transform cusp ellipticals into core el- 
lipticals via "scouring" (ejecting mass from the central regions in 
a BH-BH merger), this primarily affects the mass profile on very 
small scales (comparable to or less than the scales here - well be- 
low what we generally refer to as the "central" regions of ellipticals 
at ~kpc scales). The maximum surface density of the high-redshift 
systems is perhaps a factor ~ 2 larger than that of low-redshift sys- 
tems, but given that we are extrapolating Sersic profiles inwards 
for the high-redshift systems, this is probably an upper limit. Thus 
while the observations imply up to a factor ~ 40 evolution in effec- 
tive surface brightness, there is nowhere near this much evolution in 
the true maximum stellar surface density, a much more physically 
relevant quantity. 

Recently, iBezanson et al. I HI!) reached very similar conclu- 
sions from a similar comparison between high and low-redshift 
profiles, using a different methodology and low-redshift sample. 
One subtle difference between our conclusions and theirs is that 
they claim the central densities of the high redshift systems appear 
higher, on average, than those at low redshifts by a small factor (~ 
a couple). This owes largely to the fact that the authors extrapolate 
the best-fit Sersic profiles inwards to small radii - as we show here, 
this does lead to slightly higher central densities. But we caution 
that, doing the same in the low-redshift systems, we would obtain 
similar slightly higher densities: o ver the observed range (Figure[TJ 
there is no difference. Moreover. IBezanson et al.l d2009h compare 
the high-redshift profiles only with the average profile of similar- 
mass galaxies at low redshift; in Figure Q] and §[4] we consider the 
full distribution of profile shapes at low redshift: even if the high- 
redshift systems have, on average, slightly higher central densities, 
they are still compatible with the central densities of a large fraction 
of the z = massive elliptical population. 

It is worth noting that the mass and redshift-independence in 



Epeak in Figure[2]is somewhat surprising, given the diversity of for- 
mation histories and scatter in e.g. the mass present at larger radii 
(see e.g. lHopkins et alj|2009eh . A more detailed discussion of this 
will be the subject of future work, but it may relate to the maximum 
surface density of gas that can turn into stars (see §[6}. 



4 THE MASS AT HIGH STELLAR DENSITIES 

We now quantify the amount of mass at different stellar surface 
densities. In order to reduce the effects of noise and PSF effects 
(important in particular for the high-redshift systems), we define 
the surface density in this section as the average surface density 
within each radius, i.e. E(2?) = (E(< R)) = M»(< R)/tyR 2 . We 
obtain similar results using the local E, but with larger noise. For 
each observed system in our low-redshift sample, given the stellar 
mass profile (Figure [TJ, we calculate the total fraction of the stel- 
lar mass that lies above a given threshold in surface density E m i„. 
We evaluate this for each system separately, and in Figure [3] we 
plot the average mass fraction at each E for all observed systems in 
our sample, in several bins of total stellar mass. Although the mass 
fraction above each E m j„ can vary by a large amount from galaxy to 
galaxy, the average is surprisingly robust across masses (and does 
not depend significantly on whether we include both cusp and core 
ellipticals or evaluate the two separately). By sufficiently low E m i„ 
thresholds, ~ 10 8 MQkpc~ 2 , essentially all mass in spheroids is 
accounted for; some systems have more extended, lower surface- 
brightness envelopes, but they contribute little total mass. Approx- 
imately ~ 25% of the stellar mass density at each mass remains 
above ~ 10 I0 Mq kpc~ 2 - a typical effective surface brightness for 
high-redshift ellipticals - in moderately high-mass systems (drop- 
ping to ~ 10% by the most massive 10 13 Mq systems). By a thresh- 
old of ~ 10 1 1 Mq kpc~ 2 , we have reached the maximum/peak sur- 
face densities of ellipticals (Figure [2j, and the mass fractions at 
higher densities drop rapidly. 

The best-fit Sersic profiles of the z ~ 2 systems imply that they 
have higher mass fractions above a high surface density thresh- 
old ~ 10 10 Mq kpc~ 2 . However, as illustrated above, this primarily 
owes to their having less mass at low E, not more at high E. 

The comparison between the low and high-redshift samples 
can be made more quantitative by determining the stellar mass 
function above a given surface density threshold. To do so, we ig- 
nore all stellar mass in the Universe below a given threshold in 
E, and construct the spheroid mass function. The mass of a given 
galaxy is only the mass above that E; i.e. we calculate the volumet- 
ric number density of spheroids with E > E m j n 

m v M — djV(galaxies|M, >M ga i) 

«L> ™gal(> EminjJ = — (2) 

as a function of the integrated mass above E m j„, 



Mi=M(> E n 



E X 2nrdr. 



(3) 



The resulting mass functions are shown in Figure [4] In detail, we 
take the observed stellar mass function of spheroids dBell et all 
2003), and at each mass, convolve with t he dis tribu tion of sur- 
face d ensity profiles from lKormendv et alj (2009) and lLauer et al.l 
J2007h for systems of the same mass, to determine the result- 
ing mass function (number density) above a given surface density 
thresholdQ We are assuming that the distribution of profile shapes 



Note that "density" here has two meanings: the volume density (the y- 
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Figure 2. Distribution of peak (maximum) spheroid stellar mass surface densities. Left: Low-z systems from Figure [T] Most ellipticals, at any mass, have 
Speak ~ 0.3 — 1 X l()"MQkpc — 2 . Histograms show two different calculations of S pea ][ (within 50pc and R e /50); dotted lines are the median from each. 
Right: Same, for the high-z samples. Direct observations are PSF and seeing-limited, so we extrapolate the best-fit Sersic profile inwards to obtain E pca k (this 
is typically an upper limit in the low-z samples). The results are striking: central densities are reasonably independent of redshift. 
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Figure 3. Average mass fraction above some threshold surface stellar mass 
density E m j n , for the low-z sample from Figure [T] The values shown are 
the average fraction of all spheroid mass (in a narrow range around each 
M» ) above the given E m j„ . A significant fraction of the z = spheroid mass 
(~ 25%) resides in matter at the inferred effective E of the high-z compact 
systems (10 10 Mq kpc" 2 ). 

in lKormendv et al] d2009h and lLauer et al.l |2007|) are representative 
at each mass. Their samp le selection as well as other measurements 
(see those works and e.g. iTrujillo et all2002l : lFerrarese et al.l2006l : 
lAllenetalj|200r3) and the close agreement in fundamental plane 
correlations and Sersic index distributions in larger volumes (e.g. 
IShen et alj2003h suggest that this is probably a good assumption. 

axis of Figure|4j is the total number of galaxies meeting a given criteria per 
unit volume; the surface density is the local stellar surface mass density of 
stars within a particular galaxy. 
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Figure 4. Spheroid stellar mass function (number density) for mass above a 
given surface stellar mass density threshold (E ra j„). We calculate the z = 
mass function of spheroids including only stellar material in any galaxy 
above the given E m j n , given the total stellar mass function and the ob- 
served distribution of profile shapes. The mag enta dotted line shows the 
total spheroid mass function teell et alJliooa) - most of the mass is ac- 
counted for above 10 9 MQkpc -2 . The value of E m i„ = 10 10 MQkpc -2 
corresponds to the effective surface density of the compact high-redshift 
spheroids; points show the results of the same calculation for the observed 
high-redshif t systems and this value of E m j„. We show this for the z ~ 2.3 
data from van Dokkum et al. 1 2008, black circles) and z ~ 1 data from 
Ivan der Wei et alj i200sl triangles) . 



Unsurprisingly, Figure [4] shows that at lower £ thresholds the 
predicted mass function converges to the total stellar mass func- 
tion of spheroids, i.e. most stellar mass is accounted for. And 
at high thresholds it drops rapidly, especially at high masses: 
10 u MQkpc~ 2 is the peak surface density inside -C lkpc at both 
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Figure 5. The total global mass density of stars which reside in massive 
galaxies (total galaxy mass > 10" Mq) and are above a threshold stellar 
mass surface density (within the galaxy) of E m j n . This is the integral of Fig- 
ure[4] including only massive galaxies. We show the results from observa- 
tions at z = (black), z = 1 (blue) and z = 2 (red; shaded range corresponds 
to typical uncertainties in the total mass density of these compact high-z 
systems). The high-z data is limited to the highest £ directly observed (lim- 
ited by resolution and seeing). At z = 0, ~ 4 X 10 7 Mq Mpc - 3 is locked in 
massive ellipticals above a surface density of > 10 10 MQkpc~ 2 . This is a 
factor of a few larger than the mass density at such £ at z ~ 2. 



low and high redshifts - no observed systems have significant 
amounts of mass > 10" Mq above this threshold. 

In Figure [4] we compare the z = volume density of mass at 
E > 10 10 MQkpc -2 (solid lines) with observational inferences at 
z = 1 (diamonds) and z = 2 (circles). Specifically, given the total 
number density of > 10 10 M^ spheroids/co mpact red systems at 
these redshifts jPerez- Gonzalez et al. 2008 b]), convolved wi th the 
distribution of profile shapes from van Dokku m" et al.l (2008), we 
obtain the number density of objects with mass above the rele- 
vant E threshold. In the error budget in Figure [4] we include th e 
difference in number den sities estimated by Fontana et alj J20061) . 
Ivan Dokkum et all d2006h . and|Marchesini et al. ( 2008), and varia- 
tion in the distribution of profil e shapes/siz e s fitted in other works, 
includi ng rTruiillo et alj j2007l) ; llbft et alj d2007h : iBuitrago et ail 
(2008); ICimatti et alj fe008h . These yield similar conclusions to 
within a factor ~ 2 — 3. At the masses of interest, the observations 
should be reasonably complete to these high surface densities. 

We can also integrate the mass functions in Figure [4] to ob- 
tain the total volume density of stellar mass in spheroids above 
some threshold in surface density E m j n ; this is shown in Figure [5] 
Whereas mergers will not conserve number density, they should 
conserve total stellar mass in this calculation (to the extent that they 
do not change the E m i„ of the central regions of galaxies). Since the 
high-redshift systems are primarily massive, > 10 n Mq, and their 
descendants cannot presumably be much lower mass, we restrict 
this calculation to only systems with a total stellar mass above this 
limit (although this only removes the very lowest-mass contribu- 
tions to the high-E population in Figure [4] and does not substan- 
tially affect our comparison). We then calculate, in systems above 
this mass, the total stellar mass above each threshold E m j n . 

Figure [5] shows that there is ~ 4 x 10 7 MqMpc~ 3 of stellar 
mass in > 10 n Mq ellipticals, above a surface density threshold 
of E > 10 l0 MQkpc~ 2 in the local Universe. This is compara- 
ble to the total stel l ar mass density of high-redshift red spheroids 
( lLabbe et alj|2005l ; Ivan Dokkum et al. 2006]; iGrazian et alj|2007l ; 



lAbraham etafl kOQ?). Convolving over the observed size distribu- 
tion at z ~ 2, the mass density above this threshold E is of course 
lower ~ 1 — 2 x 10 7 Mq Mpc -3 . W e obtain a similar result co mpar- 
ing to the z — 1 observations from Ivan der Wei et alj d2008l) , with 
the relevant constraints being at lower surface density as there is 
less relative evolution. 

Our comparisons indicate that the local Universe contains just 
as much, or more, stellar mass at high surface densities as implied 
by observations of high-redshift systems. It is thus possible that all 
of the high-stellar mass density systems at high redshift can be in- 
corporated into massive ellipticals today, without any conflict with 
their observed number densities or surface brightness profiles. In 
fact, a reasonable amount of high-surface density material must 
continue to be added to the elliptical population, perhaps by gas- 
rich mergers, from high redshifts until z — 0. It is the subject of an- 
other study whether gas-rich mergers produce an adequate amount 
of high-E stellar mass in current galaxy formation models to ac- 
count for the observed growth of in Figure [5] Nonetheless, the 
high density material at high redshifts is not inconsistent with the 
z ~ data and therefore does not have to "go away." 



5 THE MASS AT LOW STELLAR DENSITIES 

Our results demonstrate that the difference between low and high- 
redshift spheroids does not arise in their central densities, but in the 
large envelopes of low surface brightness material observed in low- 
redshift systems. This is the origin of their larger effective radii. 

There are two natural ways of reconciling the low and high- 
redshift observations with the hypothesis that the high-redshift 
spheroids are the progenitors of today's ellipticals. First, the high- 
redshift systems may not have much low-density material at large 
radii; low-density material would then have to be accreted at lower 
redshifts via late-time mergers (minor or major) with gas-poor 
disks and ellipticals (i.e. lower-density systems). Such a scenario 
is feasible, and indeed expected - if the initial spheroid-forming 
mergers are sufficiently gas-rich, there will be little low-density ma- 
terial f rom extended stellar disks to contribute to an extended enve- 
lope JHopkins et al. 2009a). And comparison of clustering prop- 
erties, merger rates, and stellar populations all imply that these 
massive, high redshift systems should grow by a factor ~ 1.5 — 2 
via these channels between z ~ 2 and z = 0, more or less suf- 
ficient to account for the envelope s seen in Figure [p (see e.g. 
van Dokkumll2005l; iBell et alj|2006l ; Izheng et al.lr2007l;lLin et al.l 
20081 ; IConrov & Wechslerj2009l) . 



The second way of reconciling the low and high-redshift ob- 
servations is that high-redshift systems do have material at low sur- 
face densities already, but it is not seen in present observations (in 
such a case, there would be much less than expected of a buildup 
of low-density material). In order to be as conservative as possible, 
in considering the possible degree of envelope buildup since high 
redshifts, we consider this possibility here. 

Figure [6] illustrates circumstances under which the high- 
redshift observations may not be sensitive to extended, low-E 
wings. We consi der a few repres e ntative galaxy profiles from the 
local sample of iKormendv et all d2009l) , ranging from low-mass 
cusp ellipticals with low Sersic indices at large radii (steep sur- 
face density falloff), to high-mass core ellipticals with high Sersic 
indices at large radii (extended envelopes) □ For each, we convolve 



The specific five galaxies shown are (from top to bottom) NGC 4464, 
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Figure 6. Best- fit light profile pa r amete rs R e and n s versus image depth. Left: Observed (V-band) surface brightness profiles (black) for several typical local 
spheroids in the IKormendy et al] fe009l) sampleSfrom low mass (low-nj, i.e. steep light profile decay; top) to high mass (high-H,, i.e. shallow decay/large 
envelope; bottom). These are compared to the profiles constructed from mock images of the same objects with best-case imaging available at z ~ 0.5 — 2 (cyan 
diamonds; seeing/PSF FWHM of 1 .2 kpc, and depth/dynamic range of £i m ax — /imin = 4.5 mag; arrows in bottom left). Dotted cyan line shows the best-fit 
(convolved) Sersic profile fit to the mock high-z imaging data, extrapolated to large radii. Center: Effective radii of the best-fit Sersic profile, relative to the 
true R e , as a function of image depth. We construct a series of mock profiles with similar PSF, seeing, noise, and pixel size (as left), but vary the image 
depth Afi = fi m ax — Mmin- We fit the resulting mock profile (with each A/i) with either a free Sersic index n s (red diamonds) or fixed n s = 4 (black circles). 
Dotted vertical line shows the Afi of the mock profile at left. Right: Corresponding best-fit n s (for free-n, fits). At low masses (little stellar envelope), there 
is little dependence on the image depth. At high masses (large-envelope systems) a strong trend appears, because the z = profiles of massive galaxies are 
fundamentally not well-described over a large dynamic range by single Sersic laws. At small R < 2 — lOkpc (A/i ~ 2 — 4), the radii that dominate the fit in 
all but the deepest local imaging data, the profiles show a steeper falloff (indicative of n s < 4, weak-envelope profiles) and yield a smaller best-fit R e . Only at 
R > 20 — 50kpc (Afi ~ 6 — 8) do the low-density wings appear, leading to larger n s and R e . 



NGC 4515, NGC 4473, NGC 4365, and NGC 4552. The first three are 
classified as cusp elliptical s, the latter two as co re ellipticals. The ob- 
servations are described in IKormendy etafl (2009), but typically include 
~ 80 — 100 photometric points from a few pc to ~ 50 kpc in radii; photo- 
metric errors are < 0.04 mag arcsec" 2 (not visible in Figure[6}. They have 
stellar masses of M* = (0.17, 0.13, 1.2, 3.9, 2.0) X 1O U M , true effec- 
tive radii - fit from the full data with proper multi-component profiles - of 
Re = (0.60, 1.05, 3.19, 14.6, 10.6) kpc, and central velocity dispersions of 
a = (120, 90, 192, 271, 252)kms _l . Fitting their outer profile shapes to 



the observed profile with a simple Gaussian PSF with typical best- 
case resolution for the high-redshift observations of interest (1 a = 
0.5 kpc; FWHM 1.2 kpc). We then fit the (one dimensional) pro- 
file to a single Sersic profile (fitting appropriately convolved model 
profiles). We do this as a function of image depth. Specifically we 



Sersic profiles (where they are uncontaminat ed by the central, high surface- 
density components; see e.g. Hopkins et al. 2009a d) yields best-fit outer 
Sersic indices of n s = (2.1, 3.9, 4.6, 7.1, 9.2). 
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plot the results of the fits as a function of A/i = /i raax — /i m i n , the 
surface brightness (magarcsec -2 ) of the deepest point included rel- 
ative to the central/maximum surface brightness of the convolved 
image. In Figure[6]we plot the resulting best-fit effective radii, con- 
sidering both fits with a free Sersic index n s and a fixed n s = 4. For 
fits with a free n s , we plot the corresponding best-fit n s . 

For low-mass and intermediate-mass galaxies, which have low 
n s and therefore fall off steeply in E at large R, the true profiles 
can be recovered with even relatively shallow observations. If any- 
thing, for the lowest mass-galaxies, which have Sersic profile in- 
dices n s ^2 — 3 (when fit in thi s manner to a single Sersic index; 
see e.g. iBalcells et al]|2007bl lal: I Ferrare se et al. 2006), the effects 
of the PSF tend to slightly increase the inferred effective radius 
in shallow images. For systems very close to n s — 4, characteristic 
of intermediate-mass galaxies, there is almost no depth-dependent 
bias to the inferred R e and n s (see NGC 4515 in Figure |6). The 
same conclusions are reached in the a nalysis of mock-re d shifte d 
SDSS images of low-mass galaxies in Ivan derWel et a l. (2008). 
In short, if there is not a pronounced low-surface brightness en- 
velope, then whether or not the low-surface brightness data is well- 
observed makes little difference to the fitted properties. Again, we 
emphasize, for normal ~ L» ellipticals, there appears to be no bias, 
and the relevant parameters are recovered well via simple Sersic 
fits, even given a very limited dynamic range of fitting. 

However, in the most extreme systems characteristic of the 
most massive galaxies - those with large outer Sersic indices in- 
dicative of extended envelopes that include significant low-density 
material out to ~ lOOkpc radii - the inferred R e and n s can be sen- 
sitive to whether or not this material is included in the fits. Consid- 
ering the entire sample, we find that the typical depth required to 
obtain a "converged" R e and n s (to within ~ 30 — 50% of the value 
obtained with the deepest available data) is a strong function of stel- 
lar mass (really, the strength of the low-density envelope), rising 
from Afi ~ 4 mag in intermediate-mass systems to Afi ~ 6 — 8 in 
high-mass (M» > 10" Mq) systems. If we require that at least 50% 
of local systems have converged R e , we obtain the approximate 
mass-dependent criteria Afi com > 3.0 + 2.3 log (M./IO 11 Mq); if 
we raise our desired threshold to ~ 75 — 90%, the minimum Afi 
should be uniformly deeper by another 1 .5 mag. In terms of physi- 
cal radii, depth of Afi ~ 4 mag corresponds roughly to a maximum 
well-sampled, high S/N physical radius of ~ 5 kpc; the depth re- 
quired for obtaining converged R e in massive systems (6 — 8 mag) 
corresponds to physical radii > 20 — 50 kpc. 

This bias arises because the systems with large envelopes are 
not perfect Sersic profiles; indeed, it is now well-established that 
no populations of spheroids are completely describe d by single 



Sersic profiles given sufficient dynamic range ( s ee e.g. Kormendvl 



19991; I Ferrarese et al 



2006; ILauer et alj 120071 ; IC6t6 et al.l 2007; 



Kormendv et alj |2009). As a consequence, although a single Ser- 
sic profile is often a formally good fit (in a \ 2 sense) over limited 
dynamic range, the best-fit n s can change systematically as that dy- 
namic range changes. The effect is most pronounced in systems 
with the most dramatic envelopes; at small radii R < 2 — lOkpc, 
systems are either "concave down" (which corresponds to a local 
Sersic index n s < 4) or uncurved (n, = 4). At larger radii, the pro- 
files become more "concave up" (n s > 4). It is important to stress 
that, as such, the "correct" Sersic index is fit for the dynamic range 
sampled in the simple examples shown. If, in fact, the high-mass 
systems shown in Figure[6]were perfect Sersic profiles, there would 
be no significant effect. Ultimately, this reflects the obvious caveat 
to any limited dynamic-range observation: a fit is being extrapo- 



lated to radii not directly observed, based on some assumed func- 
tional form. 

We emphasize that Figure [6] is intended to be purely illustra- 
tive; we are not attempting to construct a specific comparison with 
or calibration for any individual observed sample. The specific cal- 
ibrations for these observations are different sample to sample, and 
well outside the scope of this paper - we refer to the relevant ob- 
servational papers for more details. For example, the real fit results 
will also depend on the resolution, instrument PSF, sky subtrac- 
tion, and other details. Given the non- trivial dependence of pro- 
file shape curvature on radius in the most extreme systems, the 
results will depend just as much on the relative weighting (error 
bars as a function of radius) in the observed profile as on any 
choice of a "cutoff" radius (recall, we simply truncate the pro- 
file at some limit - realistically, this will appear as some surface- 
brightness-dependent error bar). Moreover, in many works (e.g . 
iTruiillo et al]2007l ; Ivan Dokkum et alj2008l : iBuitrago et aHl2008f) . 
the best-fit Sersic profile is determined directly from a fit to the two- 
dimensional image data , assuming a (radius-independent) elliptic - 
ity. In other works (e.g. iKormendv et"al]|2009l ; ILauer et alj|2007f) . 
the image data is used to produce a major-axis or circularized pro- 
file (which allows for e.g. variations in ellipticity with radius or 
isophotal twists) and then fit to a Sersic profile. The attendant sys- 
tematics, at this level of detail, are not identical; and re-fitting the 
objects shown in Figure[6]in two-dimensional images tends to sup- 
press the run of Sersic index with fitted radius (owing to the differ- 
ence in relative error weighting; C. Peng, private communication). 
The consequences of those details can, in principle, affect the in- 
ferred sizes in either direction, not just towards inferring smaller 
sizes in lower-depth observations. 

That being said, we can consider these caveats in the context 
of the deepest available observations at z ~ 1—2, spa nning An ~ 
3 — 4 mag (S/N rapidly decreasing at R 5 kpc ; see lTruiillo et al.l 
l2004l ; ICimatti et al.ll2008l ; lDamianov et al.ll2009h . At these depths, 
our comparisons suggest that - if the "true" profiles were identical 
to those of today's most extreme massive galaxies (again, systems 
with ~ 100 kpc envelopes and very large n s ~ 8 — 10- the effective 
radii could be under-estimated by factors up to ~ 2 — 3. It is not, 
in principle, hard to recover fitted R e ~ 2 — 3 kpc for such systems 
in this simplified experiment, for individual objects given limited 
dynamic range. Note, however, that the details are sensitive to the 
issues above: there is less of a discrepancy (in at least this idealized 
case), for example, comparing the two-dimensional profiles. More 
importantly, even allowing for this level of an effect, it is very diffi- 
cult to account for the entire evolution observed; in short, there is a 
real deficit of low density material at large radii in the high-redshift 
systems. This is already apparent at the lowest £ sampled, in e.g. 
Figure Q] where the high-redshift profiles are falling more rapidly 
than those of low-redshift analogues. 

Again, if there is no large envelope as in the extreme case con- 
sidered here, then there will be no bias in R e in high-redshift obser- 
vations. But of course, this becomes circular - the point is simply 
that caution is warranted extrapolating any profile (especially pro- 
file shapes calibrated to the observations of low-redshift galaxies) 
to radii not directly observed. 

O ther, simple tests can constrain these possibilities. For ex- 
ample, |Zirmet_alJ q2007|) stack ~ 14 high-redshift (z > 2) systems 
identified as compact (gaining roughly an additional Afj, ~ 1.5), 
and see no significant change in Sersic profile or size (relative to the 
luminosity- weighted average size of the individual fits). Likewise , 
stacking the z = 1 compact systems in Ivan der Wei et al. I ( 120081) . 
the best-fit Sersic index is n s ~ 4, comparable to the individual 
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fit results. These appear t o support what can a lready be seen in 
Figure Q] (and Figure 1 of iBezanson et alj 2009); at the maximum 
radii ~ 5 kpc sampled in the high-redshift observations, the profiles 
do appear to be falling more rapidly with radius than profiles of 
similar-mass low-redshift galaxies. 

It appears, therefore, despite the caveats above, that there is a 
real deficit of material at low surface densities at large radii around 
massive, high-redshift galaxies. 

However, it is important emphasize two simple points this 
highlights with respect to the interpretation of observed surface 
density profiles. First, some caution is always warranted when the 
dynamic range is limited. Additional checks, such as stacking the 
observed images, calibrating against low-redshift samples, and test- 
ing for bias in surface brightness limits of the sample, are important 
(and indeed have been a critical component of many of the high- 
redshift studies considered here). Given the lack of strong con- 
straints on the amount of material at very low surface densities at 
high redshifts, the effective radii of the most extreme systems could 
(again, depending on the circumstances as noted above) be biased 
at the factor ~ 2 level. This is insufficient to explain the total ob- 
served evolution - there appears to clearly be some real evolution 
in the amount of low surface density material - but it points to the 
still relatively large uncertainties in just "how much" low density 
material is (or is not) already in place at z — 2. Constraints on this 
quantity, in particular as a function of redshift, will be of consider- 
able interest for constraining models of how low-density envelopes 
build up from high redshifts to today. 

Second, it should be borne in mind that real ellipticals are not 
always perfect Sersic profiles, and in some cases the best-fit pro- 
file shape can change depending on the dynamic range sampled. 
As such, direct comparison of pr ofiles, such as that considered here 
or in e.g. IBezanson et all {20090 are important (as opposed to just 
comparison of fitted quantities such as Sersic index and effective 
radius). Moreover, inferences made from extrapolation of Sersic 
fits should always be treated with some caution: the approach ap- 
pears to work reasonably well with normal, ~ L, galaxies, for in- 
ferring simple quantities such as the effective radius, but it ulti- 
mately depends on the assumption that the functional form of the 
Sersic profile is a good description of the "true" profile at radii 
where the direct observational constraints are not as strong. For ex- 
ample, the approach of many works attempting to infer whether 
or not significant bias is present, by mock imaging systems mod- 
eled as single Sersic profiles (artificially redshifting and imaging 
them, then re-fitting to a Sersic profile), is a useful exercise, but 
does not necessarily capture all of the physical possibilities. Even 
at low redshifts, in fact, only a small fraction of massive ellipticals 
have effective radii that can be directly determined from the ob- 
servations (i.e. converged R e from the actual observed light profile, 
independent of fitting or extrapolation, that do not change as deepe r 
radii are sampled; see e.g. the discussion in iKormendv etai]2 009). 
Extending these samples, in particular at low and intermediate red- 
shifts, is important both for studying the low-density envelopes that 
build up at later times and for informing our interpretation of the 
high-redshift observations. 



6 DISCUSSION 

6.1 Can Yesterday's Compact Spheroids Be the Cores of 
Today's Ellipticals? 

We have compared the stellar mass profiles of high-redshift, "com- 
pact" massive spheroids and well-studied local massive ellipticals. 



There has been considerable debate in the literature regarding the 
origins and fate of the compact high-redshift systems: they appear 
to be smaller (R e ~ 1 kpc at ~ 10" Mq) than all but a tiny fraction 
of local, similarly massive galaxies which have R e ~ 4 — 5 kpc at 
Z = 0. However, comparing the surface stellar mass density profiles 
directly, over the ranges that are actually well-sampled by observa- 
tions, we show that the observed high-redshift systems have surface 
density profiles similar to the inner, dense regions of local massive 
ellipticals (Figure^. 

In other words, although the effective stellar mass surface den- 
sity within R e , £ e ff = 1 /2M* / (n R?), is large in high-redshift sys- 
tems, the physical stellar surface densities are comparable to the 
typical central surface densities observed at radii ~ 0.5 — 5 kpc in 
many local ellipticals. The centers or cores of local spheroids are 
just as dense as those of high-redshift systems: the difference is in 
the effective radii and effective densities, owing to the large, ex- 
tended wings/envelopes of low surface brightness material around 
local massive spheroids. This material leads to a larger R e and lower 
effective surface density in the local systems. 

We have further shown that the distributions of the maximum 
stellar surface densities are nearly the same at z ~ 2 and z = (Fig- 
ure O - at small radii today's ellipticals have similar maximum 
nuclear stellar surface densities of ~ 0.3 — 1 x 10 1 1 Mq kpc -2 over 



a wide range in stellar mass from ~ 10 S M 







to > 1O 12 M . The 



high redshift systems have their central surface densities smeared 
out by PSF and seeing effects, and thus do not reach these den- 
sities at any observed point; but extrapolating their best-fit Sersic 
profiles inwards they exhibit similar peak surface de nsities. Simi- 
lar conclusions are reached bv lBezanson et alj d2009h as well (note 
that the factor ~a couple apparently higher densities those authors 
note in the central regions of high-redshift systems depends on ex- 
trapolating Sersic profiles to smaller radii than observed, as well 
as ignoring the scatter in the central densities of ellipticals today). 
High-redshift red galaxies are thus not uniformly more dense; in- 
deed, the maximum/peak surface density of spheroids does not ap- 
pear to evolve significantly from z > 2 to z ~ 0. 

Using a large sample of local, high-dynamic range observa- 
tions, we have constructed a census of the local spheroid popula- 
tion and have quantitatively calculated the number of systems with 
central/core mass densities above a given surface mass density and 
stellar mass threshold (Fi sures t 1 5b . We have used this to construct 
the stellar mass function of spheroid "cores" - i.e. the stellar mass 
function of the parts of today's ellipticals that lie above a given 
surface stellar mass density threshold. 

The regime of particular interest is E ~ 10 10 Mq kpc -2 , which 
corresponds to the effective surface brightness of the high-redshift 
compact systems (10 1 1 Mq with R e — 1 kpc). We find that ~ 25 — 
35% of the stellar mass density in z = massive spheroids lies 
above this surface density. Typical ellipticals have cores containing 
~ 1 — 5 x 10 10 Mq above this threshold. Comparing this to the ob- 
served properties of massive galaxies at z = 1 and z = 2, we find 
that by both number and total stellar mass, all of the high-redshift, 
compact systems can be accounted for in the cores of today's el- 
lipticals. For example, even in the extreme case in which every 
Z = 2, 10 11 Mq or larger spheroid (space density ~ 10~ 4 Mpc~ 3 ) 
had R e = 1 kpc, this would correspond to the same space density of 
systems with > 1 /2M* = 5x 10 10 Mq above the effective surface 
density 10 10 MQkpc~ 2 . At z — 0, the space density of such mas- 
sive, high surface density cores is a factor ~ 1.5 — 2 higher. Doing 
the calculation more properly (convolving over the mass function 
and distribution of profile shapes), there is a factor ~ 2 more mass 
in local massive cores than is present at z > 2; the difference is 
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qualitatively similar, but smaller, comparing to z = 1 populations. 
Not only can the high-redshift systems be accommodated (rather 
than being destroyed), but high density material in the centers of 
ellipticals may continue to build up even at relatively low redshifts. 

6.2 How Does this Relate to Physical Models? 

These conclusions are of considerable importance for physical 
models of spheroid formation and evolution, and in particular for 
the models that have been proposed to explain both the formation 
of high-redshift, apparently compact galaxies and their evolution 
into local z = systems. 

Models for spheroid formation naturally predict that ellipti- 
cals and bulges are fundamentally two-component objects, with a 
dense, central core built by dissipational processes - the loss of 
angular momentum in a progenitor gas disk, which then falls to 
the center and turns into stars in a compact starburst - and an ex- 
tended, lower-density envelope build by dissipationless processes 
- the violent relaxation of progenitor disk stars, observed to be at 
much lo wer phase-space densities than the compact cor es of el- 
lipticals dMihos & Hernquistlll994l ; iHopkins et alj|2008al) . Obser- 
vations in the local Universe have confirmed much of this picture 

i it -] | - 1 1 

and made it increasingly robust (Kormendy 1999; Hibbard & Yun 

1 19991 : iRothberg & Joseph! 1 2004 I Kormendy et alj|2009r) . Indeed, 
simulations by several independent groups consistently find that 
it is not possible to make realistic ell ipticals without the appro- 
priate mix of these two components (Barnes & Hernquist l 1 19961: 
Naab et alJ|2006l:ICox etZll200d : IOnorbe et alj|2006l : ljesseit etaU 
2007; Burkert et al. 2008). As such, the existence of dense cores in 
spheroids at both low and high redshifts is a natural consequence 
of dissipational spheroid formation. 

It is possible, if mergers are sufficiently gas-rich^ to build just 
the high-density core, and to add the envelope in relatively gas- 
poor mergers at later times. Given the gas-richness of high-redshift 
galaxies, some size evolution is naturally predicted in models, with 
high-redshift sys tems being more dom i nated by the dense, d issipa- 
tional remnant (Khochfar & Silk 2006; Hopkins et al. 20093). 

The high-redshift observations represent an ideal opportunity 
to catch such cores "in formation" and strongly constrain their 
physical origin. Today, such cores are typically extremely old: 
~ lOGyr. At hig h redshifts, however, they have ages < 500 Myr 
dKriek et al. 2006). Understanding their stellar populations, metal- 
licities, kinematics, and densities is critical to inform models of 
how dissipation builds the central regions of galaxies. There ap- 
pears to be a natural link between the observed compact red galax- 
ies and bright sub-millimeter galaxies, which are intense starbursts 
with consistent number de nsities (accounting for their short duty 
cycles) and physical sizes dTacconi et alj200 6; Younge r et alj2007L 
l2008l : ICimatti et alj2008l) . This c lass of SMGs is widely believed to 
be the product of major mergers iShapiro et alj20 08; Tacco nTet al.l 
2008). Establishing further connections between these populations 
would not only enable new tests of the merger hypothesis, but 
would also rule out alternative models (e.g. monolithic collapse) 
for spheroid core formation. 

The maximal mass densities of spheroids at both low and high 
redshift (~ 10 1 1 Mq kpc~ 2 ), for example, may inform models of 

6 We exclude cases where the systems are gas-dominated from this dis- 
cussion, as in these cases the relevant physics leads to qualitatively dif- 
ferent behavior in mergers, and will not nec essarily make spheroids at all 
iRobertson et alj20 06: Hopkins et al. 200931). 



star formation and feedback in extreme environments. This max- 
imum surface density is intriguingly similar to previous sugges- 
tions of maximal (Eddington-limited) starbursts: if the observed 
ma ss surface density is initially pure gas, forming stars according to 
the lKennicutj d 19981) relation (giving ~ 2500 Mq yr" 1 kpc -2 ), this 
implies a luminosity = 1.5 x lO 13 Z,0kpc~ 2 . This is the Edding- 
ton limit for dusty systems ( Th ompson et alj 2005). And, interest- 
ingly, it corresponds to the maximum SFR surfac e density in dense 
SMGs dTacconi et al1l2006l : I Younger et alj|2008l) . The fact that so 
few ellipticals scatter above this peak surface density also suggests 
that their centers may have formed in a few dissipational events - 
if Eddington-limited arguments explain these peak densities, then 
there is no reason why they could not be exceeded if the gas "trick- 
led in" at a lower rate or in several smaller events. Clearly, it is of 
interest to investigate constraints on star formation and feedback 
models stemming from this. 

Understanding the evolution in profile shapes, in particular 
how central versus outer densities evolve, is necessary to constrain 
how the potential and binding energy at the centers of spheroids 
evolve. This may be intimately related to the BH-host galaxy cor- 
relations in feedback -regulated models of BH growth (see e.g. 
Hopki ns et all l2007blR and references therein). There has been 
considerable debate regarding the state of the BH-host correlations 
at these redshifts: better understanding of spheroid cores that dic- 
tate the local potential depth is critical to inform theoretical models. 

Current observations suggest that a large fraction of the high- 
density material in spheroids was assembled at early times. Al- 
though we have shown that it is possible to accommodate the 
mass in dense, high redshift cores in the elliptical population 
today, the observations suggest that of order half the massive 
cores of today's massive ellipticals had to be in place by z > 2. 
Compare this to just ~ 5% of the total spheroid mass density 
in place at these reds hifts, and ~ 20% of the massive galaxy 
(> 10 n M^) density dGrazian et all 120071 ; IPerez-Gonzalez et al.1 
2008b; Marchesini et alj|2008l) . In other words, it appears that the 
massive cores of today's ellipticals assembled preferentially early. 
This is qualitatively consistent with models of dissipational forma- 
tion, but in semi-analytic models explaining early mass ive elliptical 
formation is quantitativel y quite challenging (see e.g. Bo wer et al.l 
2006: lFontana et alj2 006). The observations thus constrain not just 
the total assembly, but how this takes place - invoking early minor 
mergers or gas-poor processes, for example, might be able to ac- 
count for the shape of the mass function, but would not explain the 
early formation of dense cores. 

The observational comparison here favors models in which 
high-redshift compact galaxies are not destroyed (as has typically 
been concluded), but accrete or reveal previously "hidden" ex- 
tended envelopes of low surface-brightness material. Their central 
densities remain, but with the appearance of low-density material, 
the effective radii and profile shapes quickly become comparable 
to massive galaxies today. If the high redshift systems genuinely 
do not have such low-density envelopes, the required evolution is 
only a factor ~ 1.5 — 2 growth in stellar mass from high redshifts to 
today. Indeed, dry mergers onto such massive, early-forming sys- 
tems are cosmologically inevitable, and the mass growth require- 
ments found her e are consistent with the current stellar mass func - 
tion constraints dBrown et al . 2008; Perez-Gonzal ez et al.l 2008a). 
Moreover most dry mergers in such massive systems will be with 
later-forming, less massive and less dense systems, that are not ex- 
pected to disrupt the dense cores, but will inste ad build up an env e- 
lope of lower surface density material (see e.g. lNaab et alj2 009). 

In fact, the observations allow the possibility that the central 
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regions of some massive ellipticals today have evolved with little 
or no strong perturbation, and corresponding change in densities, 
since high redshifts, while their envelopes assemble over this same 
period in time. This is important for the kinds of mergers that influ- 
ence bulge formation: mergers with the kinds of low-density sys- 
tems needed to build up an extended, low-density envelope (minor 
mergers, mergers with relatively gas-poor disks, and mergers with 
later-forming, less dense ellipticals) will not significantly perturb 
the (much more dense) central regions of the galaxy. In detail, even 
equal mass mergers with equal density systems do significantly less 
to lower than central densities than a simple energetic argument 
would imply: the central density profile remains relatively unper- 
turbed while energy ten ds to be preferentially transferred to less 
bound outer material (see lHopkins et alj2009d) . 

Such mergers may also transform an initial central "cuspy" 
profile into a "cored" profile, via the scouring action of a binary 
BH-BH merger (if the initial high-redshift systems are "cuspy," ex- 
pected if they have just formed in gas-rich mergers, then they must 
become cored by low-redshift to correspond to observed systems ; 
see e.g. iFaber et alj|l997l : iGraham et af] 120031 : ICote et all [20071 : 
iKormendv et al.lr2009l) . Both minor and major dry mergers are ex- 
pected to be efficient mechanisms for scouring, even where the sec- 
ondary is sufficiently low density s o as not to perturb the d ensity 
profile as discussed above (see e.g.lMilosavlievic et al. 2002]). But 
the cusp-core distinction (in all but the most extreme systems) af- 
fects the mass profile only on very small scales (well below the 
scales observed in all but the nearest systems) and involves only a 
very small fraction of the galaxy mass (~M BH ,or~ 10~ 3 M„). 

6.3 Observational Tests and Future Prospects 

Observed high-redshift systems require the growth of extended, 
low surface density envelopes to match the profiles of massive el- 
lipticals today. It is quite possible that such envelopes are entirely 
absent at high redshifts. However, Figure[6]highlights the fact that 
it remains difficult to say precisely how much of a "mass deficit" at 
large radii must be accreted from z = 2 to z — 0. In short, the total 
mass at very low surface densities at high redshifts remains more 
uncertain than the mass budget at high surface densities. And even 
where observations detect light at relatively low surface bright- 
ness, the relevant S/N weighting means that the profile fits can 
be preferentially weighted towards the high-brightness central re- 
gions. As a result, there could remain some non-trivial differences 
in the best-fit R e from high-redshift observations, compared to deep 
low-redshift observations of analogous systems. 

At low and intermediate stellar masses, where Sersic indices 
tend to be low (n s < 4), our simple experiments, as well as those 



used therein (see e.g.lZirm et alj|2007 


:lTruiillo et alj|2006bll200l 


Ivan der Wei et al. 2008: Cimatti et al. 


2008), suggest that there is 



no strong dependence on the depth of the observations. In other 
words, when there is no significant envelope, the precise mass pro- 
file of the low-density material makes little difference. At the high- 
est masses, however - i.e. local systems where the outer Sersic in- 
dices are very large (n s > 6) because of the presence of large stel- 
lar envelopes that can extend to > lOOkpc scales - there can be a 
non-trivial mass at extremely low brightness, whose profile is dif- 
ficult to recover in detail. If massive-envelope ellipticals such as 
NGC 4552 were present at z > 0.5, full recovery of their envelopes 
(in the sense of recovering high S/N profile information) would 
require observations ~ 4 — 5 mag deeper than typical of individ- 
ual high-redshift objects (Figure |6). Even at low redshifts, only a 



small fraction of objects at the highest masses have directly mea- 
sured and converged effective radii (determined purely from the 
observed light profile); in other cases the effective radius is usually 
estimated by extrapolation of a best-fit Sersic profile to low den- 
sities and large radii. It is possible, if envelopes are present, but 
do not follow a perfect Sersic profile (or if the Sersic profile shape 
of the galaxy profile changes from small to large radii), to under- 
estimate the envelope mass, and correspondingly to under-estimate 
the true effective radius by a factor ~ 2. 

This has important consequences even at low redshifts. Some 
recent studies have claimed that most of the apparent size-mass 
evolution in massive galaxies occurs at very low redshifts, a fac- 
tor of ~ 2 — 3 change in sizes between z = 0.1 — 0.3, with the 
evolution from z = 0.4 — 2 restricted to a smaller factor ~ 1.5 — 2 
( Bernardi 2009; Ferreras et al. 2009). Such extreme apparent low- 
redshift evolution (as opposed to the more plausible high-redshift 
evolution) may be related to the observed dynamic range and fit- 
ting: it is almost impossible to explain such a large change in 
the true stellar half-mass radius in any model over a narrow low- 
redshift interval z = 0.1 —0.3. Independent constraints have clearly 
established that in this redshift range, there is almost no evolu- 
tion in the stellar mass function of massive spheroids, nor is there 
significant evolution in the (uniformly old) stellar populations, 
or any significant stellar mass l oss or new star formation given 
the o l d stellar population ages ( Thomas et al. 2005; Nelan et al. 

m 



20051: iGallazzi et alj|2006l : iMasiedi et alj|2006l ; iBorch et alj|2006.. 
Jones et alj l2006h . Likewise, there is no change in their kine- 



matics or the fundamental plane relation in clusters or the field 
jTreu et alj2005tldi Serego Alighieri et alfeOoHlvan der Wei et al.l 
l2005t I van Dokkum & van der Marelll2007b . Even the dark matter 
halos grow by only a tiny fraction over the redshift range of z = 0. 1 
to 0.3 (< 0.1 dex; almost all accreted at large radii). What do 
change over this redshift range, however, are the spatial resolution 
and effective surface brightness limits, and as a consequence the 
radial range of the profile sampled in observations. We find that the 
observations of rapid changes in the best-fit R e at low redshifts can 
be accounted for by the biases summarized in Figure[6] And indeed, 
other observations, using somewhat different fitting methodology 
and sample selection, have found no such significant evolution at 
low redshifts , while the high-redshift evolution appears more ro- 
bust ( see e.g. lMcIntosh et alj2005l ; lTruiillo et al.l2007l ; lFranx et al.l 
l2008h . 

Clearly, better constraints on just how much low-surface 
brightness material is present, as a function of redshift, will im- 
prove our understanding of elliptical galaxy formation. Whether 
there has been a great deal of evolution in the amount of low- 
density material, or relatively little, either result is of considerable 
interest. If high-redshift systems have essentially no stellar mass 
at low densities (E < 10 9_ '°Mq kpc -2 ), then mergers forming 
spheroids must initially be very gas-rich, and mergers with lower- 
mass galaxies or evolved disks at high redshift must be relatively 
rare. Moreover, this would determine how much material would 
have to be built up by subsequent mergers, putting requirements on 
models for tidal destruction and minor mergers as well as later dry 
major mergers. If some envelopes are present at high redshifts - 
perhaps not as much as at low redshifts, but still non-negligible in 
mass - then it would strongly limit the amount of growth, accre- 
tion, and dry merging such systems could undergo at lower red- 
shifts. To the extent that envelopes exist that contain significant 
mass not previously detected, it would increase the stellar masses of 
these galaxies, perhaps yielding tension with constraints from e.g. 
the galaxy mass function, and certainly presenting a challenge for 
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models, which have difficulty explaining very early mass assembly 
without much subsequent growth. 

There are a number of observational means to study these pos- 
sibilities. Most directly, observations can probe whether such ma- 
terial exists at high redshifts. In addition to profile fitting, which 
necessarily relies on the assumption that some functional form will 
represent a good approximation to light at radii where the profile 
shape cannot be strongly constrained directly, it should be pos- 
sible to estimate that total light contribution from comparison of 
deep integrations of the total light/stellar mass in fixed physical an- 
nuli. Also, stacking the observed high-redshift systems, allowing 
for a gain of ~ 1 — 1.5 mag in surface brightness depth, seems to 
show a continued steep falloff in surface density (n s < 4), indicat- 
ing that there is not a large undetected mass in e nvelopes at high 
redshift fcirm et alfeOOllvan der Wei et al.ll2008l) . This reinforces 
the conclusion that the envelopes must be predominantly built up at 
lower redshifts; unfortunately, it remains difficult to say precisely 
how much material is at low densities (other than to say that it is 
not a large fraction of the galaxy mass and cannot change R e at 
the level of evolution seen), or what the exact mass profile of the 
low-density material is (e.g. whether it corresponds to the inference 
from extrapolating the fitted Sersic profiles, cuts off more steeply, 
or obeys a power law-like distribution analogous to observed intra- 
cluster/halo light in low-redshift systems). 

Observations of major and minor dry merger rates offer com- 
plimentary constraints on how much material is added to these sys- 
tems between high redshifts and today. In the local universe, study- 
ing the properties of these envelopes can inform models of their 
formation histories. For example, stellar population gradients and 
kinematics might reflect a more dramatic transition in properties 
if the envelopes form by late accretion onto earlier-forming cores. 
If the envelopes form early, they will be metal poor and have dif- 
ferent Qf-element abundances compared to late-accreting disk/outer 
bul ge material. Some efforts have been made along these lines (see 
e.g. lSanchez-Blazquez et al .120071) . and gradients in e.g. a-element 
abundances show significant diversity, suggesting that in some sys- 
tems, envelopes formed quickly (perhaps in e.g. later-forming el- 
lipticals, where a large envelope can be formed from relatively gas- 
poor disks), while in others, envelopes are contributed by systems 
with more extended star formation (e.g. late mergers with lower- 
mass or later-forming systems, as expected for systems described 
here). However, samples remain small, and the differences are sub- 
tle; larger samples are needed to correlate the behavior of these 
properties with other aspects of galaxies (e.g. galaxy mass, kine- 
matics, cusp/core status, environment) that might provide an in- 
dication of whether or not they are descendants of similar high- 
redshift systems. Moreover, typical observations of these quantities 
extend only to relatively small radii (~ R e ); it remains difficult to 
probe stellar populations in the low-density outer regions that con- 
stitute spheroid envelopes. Together, however, these observations 
offer promising avenues towards constraining these observationally 
challenging, low-surface brightness components. 

Corroborating evidenc e for the picture presente d here has re- 
cently been presented in ICenarro & Truii llo (2009). The authors 
measure the average velocity dispersions of compact spheroids at 
z — — 2, and show that it evolves weakly relative to e.g. the 
naive expectation of models with uniform contraction/expansion 
(that a 2 oc M*/R e ). Despite factor ~ 6 evolution in effective radii 
of massive galaxies over this interval, the median a at fixed mass 
evolves by a factor ~ 1.3. Moreover, the high-redshift a values are 
in fact consistent with the high end of the observed low-redshift 
scatter at this mass, or with the median a of slightly more massive 



(factor ~ 2) low-redshift ellipticals. This is precisely the behavior 
expected if the central, high-density regions (which determine the 
central potential depth and correspondingly a) assemble preferen- 
tially early, and evolve relati vely unperturbed to lo w redshift (for 
examples in simulations, see Hopk ins et alj|2009ch . Low-density 
material accreted later is clearly important for the effective radii, 
and changes e.g. the dark matter fraction within R e as well as the 
profile shape, but has very little effect on the velocity dispersion. 
The observations represent a strong constraint on the physical den- 
sities at fixed radius: if the central densities at high redshift were in 
fact substantially higher than those at low redshift, by even a factor 
of a few, it would significantly over-predict the observed evolution 
in velocity dispersions. 

Throughout this paper, we have neglected several additional 
physical effects that might affect estimates of the stellar surface 
mass densities and effective radii of high-redshift ellipticals. For 
example, at low redshifts, stellar mass-to-light ratios are indepen- 
dent of radius to within ~ 20% in optical bands in massive ellip- 
ticals, which tend to be uniformly old and have w eak color gradi- 
ents (see e.g. lFaber et al. ll989l ; lBenderetal.ll993l) . However, given 
the observed stellar population gradients run backwards in time, or 
considering local recent merger remnants and/or simulations, the 
expectation is that this could be quite different at the high red shifts 
when the ellipticals formed l Schweizeill 19961 ; iRothberg & J oseph 
l2004l ; lYamauchi & GotdlajolT The young, post-starburst popula- 
tion (ages < 1 — 2 Gyr) in the core has higher optical L/M* , which 
can lead to a rest-frame 6-band best-fit R e that is a factor ~ 1.5 — 2 
smaller than the stellar-mass R e (or the S-band R e observed when 
the system has aged and this effect vanishes). Some reassurance, 
however, comes from the fact that the size-mass relation does not 
seem to depend strongly on the precise stellar population age/colors 
(sp ecifically within the "quenched" or "star-form ing" populations; 
see lPerez-Gonzalez et alj|2008d ; lToft et alj|2007l) and the fact that 
the observed sizes of the ga laxies of interest her e are similar in rest- 
frame near-UV and optical dTruiiilo et al .120071) . There may also be 
some bias in estimated stellar masses of systems at similar ages , 
owing to the uncertai n contribution of AGB stars dMarastor]|2005l ; 
iMaraston et alj [2006b . This is estimated to be a possible factor 
~ 1.2 — 1.4 over-estimate of the high-re dshift masses where high - 
quality optical photometry is available dWuvts et alj|2007l . [2009 ) . 
We have conservatively ignored these effects, but if present they 
will strengthen most of our conclusions. To test for these effects, 
it is important to obtain deeper spectra and photometry, to test for 
the presence of blue cores and constrain the contribution of stellar 
populations of different ages (and stellar population gradients) in 
systems at intermediate and high redshifts. 
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